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Phonology, as it is practiced, is deeply computational. Phonological analy- 
sis is data-intensive and the resulting models are nothing other than specialized 
data structures and algorithms. In the past, phonological computation - managing 
data and developing analyses - was done manually with pencil and paper. In- 
creasingly, with the proliferation of affordable computers, IPA fonts and drawing 
software, phonologists are seeking to move their computation work online. Com- 
putational Phonology provides the theoretical and technological framework for 
this migration, building on methodologies and tools from computational linguis- 
tics. This piece consists of an apology for computational phonology, a history, 
and an overview of current research. 

Documentation and Description. Phonological data is of essentially three 
types: texts, wordlists and paradigms. A text is any phonetically transcribed nar- 
rative or conversation. A wordlist is any compilation of linguistic forms which 
can be uttered in isolation, with information about pronunciation and meaning. A 
paradigm is broadly construed to mean any tabulation of words or phrases which 
illustrates contrasts and systematic variation. Any of these data types may be an- 
notated with more abstract information originating from a phonological theory, 
such as syllable boundaries, stress marks and prosodic structure. Additionally, 
any of these data types may be associated with recordings of audio, video or 
physiological signals. Digitizing this documentation and description brings all 
the different media types together, makes the cross-links navigable, and opens up 
many new possibilities for management, access and preservation. 

Exploration and Analysis. The data types described above are closely inter- 
connected in phonological practice. For instance, the discovery of a new word 
in a text may require an update to the lexicon and the construction of a new 
paradigm (e.g. to correctly classify the word). Fresh insights may lead to new 
annotations and further elicitation, closing the loop in this perpetual, exploratory 
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process. Phonological analysis typically involves defining a formal model, sys- 
tematically testing it against data, and comparing it with other models. (In some 
cases, the model may be incorporated into a software system, e.g. for generating 
natural intonation in a text-to-speech system.) In this exploration and analysis - 
sorting, searching, tabulating, defining, testing and comparing - the principal task 
is computational. 

Perhaps the earliest work in computational phonology was Bobrow and Fraser's 
Phonological Rule Tester ( Pobrow and Fraser, 1968D , an implementation of SPE 
designed to "alleviate the problem of rule evaluation." Shortly afterwards Johnson 
showed that, while SPE rules resemble general rewriting systems at the top of the 
Chomsky hierarchy, the way SPE rules are used in practice only requires finite 
state power ( pohnson, 1972| ). Independently, Kaplan and Kay discovered the con- 
nections between SPE grammars and finite state transducers in the 70's and 80's, 
and laid down a complete algebraic foundation (ultimately reported in (Kaplan 
and Kay, 1994)). Significant implementations followed, including ( Koskenniemi, 
1983; [Beesley and Karttunen, 2002D . Attempts to apply finite state devices to 
Autosegmental Phonology have largely foundered, but applications to Optimality 
Theory are thriving. 

While finite-state phonology fixated on SPE, generative phonology contin- 
ued its rapid evolution. The discovery of rule "conspiracies" ( [Kisseberth, 1970| ) 
and the abstractness controversy ( pCoutsoudas et al., 1974| ), lead to calls for the 
reintroduction of surface structure constraints. Many theories arose from the fall- 
out; most notable for its computational ramifications was Montague Phonology 
( [Wheeler, 198 1] ). This model adapted new lexicalist formalisms from syntax and 
semantics, providing a declarative (as opposed to procedural) account of phono- 
logical well-formedness, and providing the first computational account of under- 
specification (where the phonological content of a lexical entry is incompletely 
specified, to be filled in during a derivation). From these beginnings. Declarative 
Phonology was bom, and subsequent work provided a mathematical foundation 
in first-order logic ( [Bird, 19951 ) phonetic interpretation with links to Firthian 
prosodic analysis and speech synthesis ( Poleman, 1997] ), with implementations 
generally in the Prolog programming language. 

A third major strand of development, complementing the finite state and declar- 
ative models, is best characterized as statistical. It seeks to apply neural networks, 
information theory, and weighted automata in the automatic discovery of phono- 
logical information. Gasser trained a recurrent neural network to recognize syl- 
lables and to repair ill-formed syllables ( passer, 1992[ ). Ellison showed how a 
technique from information theory called MDL - minimum description length - 
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could be applied to automatically identify syllable boundaries in phonemically 
transcribed texts ( [Ellison, 1992| ). Many researchers apply Markov models (a kind 
of weighted automata) in speech recognition, mapping speech recordings to pho- 
netic transcriptions and thence to orthographic words, using large, phonetically 
annotated corpora as training data (e.g. TIMIT ( parofolo et al., 198^ )). 

Four key areas of ongoing research in computational phonology are in Opti- 
mality Theory, automatic learning, interfaces to grammar and phonetics, and sup- 
porting phonological description in the field. Comprehensive references to online 
research papers in this areas may be found on the SIGPHON website. 

Computational phonology is generating sophisticated and rigorous ways for 
creating, exploring and disseminating multidimensional phonological informa- 
tion, encompassing primary recordings, texts, wordlists, paradigms, theories and 
analyses. As phonologists adopt the computational methods described above, ex- 
tending and adapting them as needed, the consequences for the discipline will be 
increased accessibility, accountability, and stability of empirical research. 

Resources. The Association for Computational Linguistics (ACL) has a spe- 
cial interest group in computational phonology (SIGPHON) with a homepage at 
http : / / www .cogsci.ed.ac.uk/ sigphon/[ The website contains on- 
line proceedings for SIGPHON workshops and information about relevant books, 
dissertations and articles. A special issue of Computational Linguistics devoted to 
computational phonology was published in 1994 ( [Bird, 199^ . 
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